Empirical Evaluation of Tree distances for Parser Evaluation

نویسنده

  • Taraka Rama
چکیده

In this empirical study, I compare various tree distance measures – originally developed in computational biology for the purpose of tree comparison – for the purpose of parser evaluation. I will control for the parser setting by comparing the automatically generated parse trees from the stateof-the-art parser (Charniak, 2000) with the gold-standard parse trees. The article describes two different tree distance measures (RF and QD) along with its variants (GRF and GQD) for the purpose of parser evaluation. The article will argue that RF measure captures similar information as the standard EvalB metric (Sekine and Collins, 1997) and the tree edit distance (Zhang and Shasha, 1989) applied by Tsarfaty et al. (2011). Finally, the article also provides empirical evidence by reporting high correlations between the different tree distances and EvalB metric’s scores.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Studying impressive parameters on the performance of Persian probabilistic context free grammar parser

In linguistics, a tree bank is a parsed text corpus that annotates syntactic or semantic sentence structure. The exploitation of tree bank data has been important ever since the first large-scale tree bank, The Penn Treebank, was published. However, although originating in computational linguistics, the value of tree bank is becoming more widely appreciated in linguistics research as a whole. F...

متن کامل

Grammar & Parser Evaluation in the XTAG Project

In this paper we discuss several methods used to evaluate the XTAG parser and English grammar. We consider the methods proposed in the literature for grammar and parser evaluation, and give some empirical reasons for electing to use certain methods over others. We propose a general framework for evaluation, which is then used to evaluate the English grammar and parser developed as part of the X...

متن کامل

MDA Support for Constraint Checking Framework in EJB

Syntax Tree run attribute evaluator run LALR(1) parser Textual Constraints Concrete Syntax Tree Model constraints loaded [no evaluation errors] hasModelAndCst model loaded

متن کامل

Tree Distance in Answer Retrieval and Parser Evaluation

The use of syntactic tree-distance as a surrogate for semantic distance in an answer retrieval task is investigated. The feasibility of this is confirmed by showing that retrieval performance increases with parse quality, and an application of this to parser evaluation is discussed. Variant definitions of tree-distance involving parameters such as whole vs sub-tree, node weighting, wild-card tr...

متن کامل

An Evaluation of Parser Robustness for Ungrammatical Sentences

For many NLP applications that require a parser, the sentences of interest may not be well-formed. If the parser can overlook problems such as grammar mistakes and produce a parse tree that closely resembles the correct analysis for the intended sentence, we say that the parser is robust. This paper compares the performances of eight state-of-the-art dependency parsers on two domains of ungramm...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1409.0314  شماره 

صفحات  -

تاریخ انتشار 2014